Deep CNN Object Features for Improved Action Recognition in Low Quality Videos

نویسندگان

  • Saimunur Rahman
  • John See
  • Chiung Ching Ho
چکیده

Video based action recognition has been an active area of research in recent years. This is due to the many potential applications such as video surveillance, video content analysis, human computer interaction and video archiving. The ongoing trend of research deals with many complex action recognition problems such as appearance, pose and illumination variations but problem video quality is still considered unexplored. Under low quality conditions, the major challenge is to develop robust feature representation methods that possess discriminative capacity for modeling actions. Many feature representation methods that have been proposed in recent years can be classified into two types: hand-crafted and deeply-learned features. Among various hand-crafted methods proposed in literature, STIP, Cuboid, Hessian, dense sampling, dense trajectory (DT) and improved dense trajectories (IDT) are popular choices. These methods generally rely Email Address: [email protected] Fig.1. Sample low quality videos form UCF-11 ‘compressed’ version (top row) and HMDB51 ‘bad’ and ‘medium’ quality subsets (bottom row).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos

The state-of-the-art performance for object detection has been significantly improved over the past two years. Besides the introduction of powerful deep neural networks such as GoogleNet [1] and VGG [2], novel object detection frameworks such as R-CNN [3] and its successors, Fast R-CNN [4] and Faster R-CNN [5], play an essential role in improving the state-of-the-art. Despite their effectivenes...

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos

Deep learning has been demonstrated to achieve excellent results for image classification and object detection. However, the impact of deep learning on video analysis (e.g. action detection and recognition) has not been that significant due to complexity of video data and lack of annotations. In addition, training deep neural networks on large scale video datasets is extremely computationally e...

متن کامل

Hand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study

Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016